Topological Bias in Distance-Based Phylogenetic Methods: Problems with Over- and Underestimated Genetic Distances
نویسنده
چکیده
I show several types of topological biases in distance-based methods that use the least-squares method to evaluate branch lengths and the minimum evolution (ME) or the Fitch-Margoliash (FM) criterion to choose the best tree. For a 6-species tree, there are two tree shapes, one with three cherries (a cherry is a pair of adjacent leaves descending from the most recent common ancestor), and the other with two. When genetic distances are underestimated, the 3-cherry tree shape is favored with either the ME or FM criterion. When the genetic distances are overestimated, the ME criterion favors the 2-cherry tree, but the direction of bias with the FM criterion depends on whether negative branches are allowed, i.e. allowing negative branches favors the 3-cherry tree shape but disallowing negative branches favors the 2-cherry tree shape. The extent of the bias is explored by computer simulation of sequence evolution.
منابع مشابه
Study on Phylogenetic Relationship among some of Iranian Wild Almond Species using Sequences of ITS1-5.8S rDNA-ITS2 Region and Chloroplastic trnL
Phylogenetic relations among 12 wild species of almonds, one cultivated almond and one species of peach were investigated by using of ITS1-5.8S rDNA-ITS2 sequences and trnL region of chloroplast DNA. To do this, maximum-parsimony and neighbor joining analysis adopted. Results of ITS data showed that studied species of Prunus only divided in two groups but incapable to separate different section...
متن کاملTowards Distance-Based Phylogenetic Inference in Average-Case Linear-Time
Computing genetic evolution distances among a set of taxa dominates the running time of many phylogenetic inference methods. Most of genetic evolution distance definitions rely, even if indirectly, on computing the pairwise Hamming distance among sequences or profiles. We propose here an average-case linear-time algorithm to compute pairwise Hamming distances among a set of taxa under a given H...
متن کاملMolecular Characterization and Phylogeny Analysis Based on Sequences of Cytochrome Oxidase gene From Hemiscorpius lepturus of Iran
Abstract: Background: Hemiscorpius lepturus is a medically important scorpion found along the Iranian borders, especially near to Khuzestan Province in the south-west of Iran. This is the only non-buthid scorpion which is potentially lethal in southern Iran and is responsible for severe dermonecrotic scorpionism. OBJECTIVES: In this study, DNA fragment of the mitochondrial cytochrome c oxidase ...
متن کاملTaxonomic status of six populations of the Gobiids (Teleost, Gobiidae) in the southern Caspian Sea basin using COI gene
Morphological similarities of the populations and species of the genus Ponticola in the southern Caspian Sea make their identification difficult based on the provided keys. Molecular markers are used as a supplementary tools to better understanding of the fishes’ taxonomic statues. This study was conducted to understand taxonomic statues of gobiids from 6 rivers (Siah, Zaringol, Babol, Tajan, S...
متن کاملAn Adaptive LEACH-based Clustering Algorithm for Wireless Sensor Networks
LEACH is the most popular clastering algorithm in Wireless Sensor Networks (WSNs). However, it has two main drawbacks, including random selection of cluster heads, and direct communication of cluster heads with the sink. This paper aims to introduce a new centralized cluster-based routing protocol named LEACH-AEC (LEACH with Adaptive Energy Consumption), which guarantees to generate balanced cl...
متن کامل